Cost- and Energy-Aware Load Distribution Across Data Centers
نویسندگان
چکیده
Today, many large organizations operate multiple data centers. The reasons for this include natural business distribution, the need for high availability and disaster tolerance, the sheer size of their computational infrastructure, and/or the desire to provide uniform access times to the infrastructure from widely distributed client sites. Regardless of the reason, these organizations consume significant amounts of energy and this energy consumption has both a financial and environmental cost. Interestingly, the geographical distribution of the data centers often exposes many opportunities for optimizing energy consumption and costs by intelligently distributing the computational workload. We are interested in three such opportunities. First, we seek to exploit data centers that pay different and perhaps variable electricity prices. In fact, many power utilities now allow consumers to choose hourly pricing, e.g. [1]. Second, we seek to exploit data centers that are located in different time zones, which adds an extra component to price variability. For example, one data center may be under peakdemand prices while others are under off-peak-demand prices. Third, we seek to exploit data centers located near sites that produce renewable (hereafter called “green”) electricity to reduce “brown” energy consumption that is mostly produced by carbon-intensive means, such as coal-fired power plants. To make our investigation of these degrees of freedom more concrete, in this paper we consider multi-datacenter Internet services, such as Google or iTunes. These services place their data centers behind a set of front-end devices. The front-ends are responsible for inspecting each client request and forwarding it to one of the data centers that can serve it, according to a request distribution policy. Despite their wide-area distribution of requests, services must strive not to violate their servicelevel agreements (SLAs). This paper proposes and evaluates a framework for optimization-based request distribution. The framework enables services to manage their energy consumption and costs, while respecting their SLAs. It also allows services to take full advantage of the degrees of freedom mentioned above. Based on the framework, we propose two request distribution policies. For comparison, we also propose a greedy heuristic designed with the same goals and constraints as the other policies. Operationally, an optimization-based policy defines the fraction of the clients’ requests that should be directed to each data center. The front-ends periodically (e.g., once per hour) solve the optimization problem defined by the policy. After fractions are computed, the front-ends abide by them until they are recomputed. The heuristic policy operates quite differently. During each hour, it first exploits the data centers with the best power efficiency, and then starts exploiting the data centers with the cheapest electricity. Our evaluation uses a day-long trace from a commercial service. Our results show that the optimizationbased policies can accrue substantial cost reductions by intelligently leveraging time zones and hourly electricity prices. The results also show that we can exploit green energy to achieve significant reductions in brown energy consumption for small increases in cost.
منابع مشابه
Energy Aware Resource Management of Cloud Data Centers
Cloud Computing, the long-held dream of computing as a utility, has the potential to transform a large part of the IT industry, making software even more attractive as a service and shaping the way IT hardware is designed and purchased. Virtualization technology forms a key concept for new cloud computing architectures. The data centers are used to provide cloud services burdening a significant...
متن کاملCost-Aware Virtual Machine Placement in Cloud Computing Systems
Cloud service providers operate multiple geographically distributed data centers. These data centers consume huge amounts of energy, which translate into high operating costs. Interestingly, the geographical distribution of the data centers provides many opportunities for cost savings. For example, the electricity prices and outside temperatures may be widely different at the data centers. This...
متن کاملRenewable-aware geographical load balancing of web applications for sustainable data centers
The ever-increasing demand for web applications deployed across multiple data centers results in large electricity costs for service providers and significant impact on the environment. This has motivated service providers to move towards more sustainable data centers powered by renewable or green sources of energy, such as solar or wind. However, efficient utilization of green energy to servic...
متن کاملAdaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کاملAnalyzing the Cooling Efficiency of the Thermal Environment in Data Centers
Given the explosive growth of data, scalability and fault tolerance have become a fundamental challenge for data center network structures. Temperature in data centers significantly affects the failure ratio of high-speed network devices. Various types of air distribution schemes influence the temperature of network equipment differently, and the cooling cost in data centers dominates the overa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009